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ABSTRACT 

We present new tests for disruption mechanisms of star clusters based on the bivariate mass- 
age distribution g(M, t). In particular, we derive formulae for g(M, t) for two idealized models 
in which the rate of disruption depends on the masses of the clusters and one in which it does 
not. We then compare these models with our Hubble Space Telescope observations of star clusters 
in the Antennae galaxies over the mass-age domain in which we can readily distinguish clusters 
from individual stars: r < lO^(M/lO^Af0)^-^ yr. We find that the models with mass-dependent 
disruption are poor fits to the data, even with complete freedom to adjust several parameters, 
while the model with mass-mdependent disruption is a good fit. The successful model has the 
simple form g{M,T) oc Af^^r^^, with power-law mass and age distributions, dN/dM oc Af~^ 
and dN/dr cx t~^. The predicted luminosity function is also a power law, dN/dL oc L^^, in good 
agreement with our observations of the Antennae clusters. The similarity of the mass functions 
of star clusters and molecular clouds indicates that the efficiency of star formation in the clouds 
is roughly independent of their masses. The age distribution of the massive young clusters 
is plausibly explained by the following combination of disruption mechanisms: (1) removal of 
interstellar material by stellar feedback, r < 10^ yr; (2) continued stellar mass loss, lO^yr < r < 
lO^yr; (3), tidal disturbances by passing molecular clouds, r > 10^ yr. None of these processes is 
expected to have a strong dependence on mass, consistent with our observations of the Antennae 
clusters. We speculate that this simple picture also applies — at least approximately — to the 
clusters in many other galaxies. 

Subject headings: galaxies: individual (NGC 4038, NGC 4039) — galaxies: interactions — galaxies: star 
clusters — stars: formation 



1. INTRODUCTION 

Star clusters are born in molecular clouds and 
are then dispersed into the surrounding stellar 
field by a variety of processes operating on differ- 
ent timescales, including the removal of interstel- 
lar material by stellar feedback, continued stellar 
mass loss, tidal disturbances by passing molecu- 
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lar clouds, gravitational shocks during rapid pas- 
sages near the galactic bulge or through the galac- 
tic disk, orbital decay into the galactic center 
caused by dynamical friction, and stellar escape 
driven by internal two-body relaxation. Signa- 
tures of these processes are encoded in the mass 
function, ■ip{M) cx dN/dM, and the age distribu- 
tion, x{^) dN/dr, for a population of clusters 
in a particular galaxy. A more informative, but 
less familiar statistic is the bivariate distribution 
of masses and ages, g(M, t) oc d^N/dMdr. The 
key feature of g{M,T) is that it includes correla- 
tions between M and r, information that is ab- 
sent from the univariate distributions ^{M) and 
x(t). Thus, with the help of g(M,T), we can an- 
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swer questions such as whether low-mass clusters 
are disrupted faster than, or at the same rate as, 
high-mass clusters. In this paper, we derive for- 
mulae for g{M, t) for the first time for several dif- 
ferent models for the disruption of clusters, and 
we then compare these formulae with our Hub- 
hie Space Telescope [HST) observations of clusters 
in the Antennae galaxies. In a companion paper, 
we present similar comparisons for the clusters in 
the Large and Small Magellanic Clouds (LMC and 
SMC; Chandar et al. 2009, hereafter CFW09). 

We have focused on the star clusters in the An- 
tennae galaxies for several reasons. First, the pop- 
ulation of clusters is large, N k, 2300 brighter 
than Mv = —9, ample for statistical studies. 
Second, many of the bright young clusters have 
masses in the range of old globular clusters, lO'' < 
M/Mq < 10^, suggesting that the former may 
simply be younger versions of the latter. Third, 
because the Antennae are currently in the throes 
of a major merger, they give us a close-up, inter- 
nal view of events and processes that were more 
common in galaxies in the past, during their hi- 
erarchical formation. Fourth, the Antennae have 
been mapped at almost every wavelength available 
to astronomers, from radio to X-rays, providing a 
more detailed picture of their stellar and inter- 
stellar contents than for almost any other galaxy 
outside the Local Group. 

In our previous studies of the clusters in the 
Antennae, we found that the mass and age dis- 
tributions could be approximated by power laws: 
V'(M) oc M/^ with /3 « -2 (Zhang & FaU 1999, 
hereafter ZF99) and x(t") oc with 7 « -1 (Fall 
et al. 2005, hereafter FCW05). In these stud- 
ies, we also found that the shapes of ip{M) and 
x(t) are approximately independent of the age 
and mass limits, respectively, of the samples used 
to determine them. These findings suggest that 
the bivariate mass-age distribution can be approx- 
imated simply by the product of the univariate 
distributions: 

g{M, r) oc V'(M)x(t) oc M'^t'^ (1) 

for T < 10^(Af/10''Mo)i-3yr. (2) 

The second expression above indicates the approx- 
imate domain of validity of the first, the condition 
that objects in the sample be brighter than the 



most luminous individual stars {Ly > 3x 1O^L0)0 
In this model, clusters form with a power-law ini- 
tial mass function and are then disrupted at rates 
that are independent of their masses. We have al- 
ready explored some of the consequences of equa- 
tion (1) in two recent papers (Fall 2006; Whitmore 
et al. 2007, hereafter WCF07). In this paper, we 
make further tests of this model, and we also com- 
pare our HST observations of the Antennae clus- 
ters with two models in which clusters of different 
masses are disrupted at different rates. 

The age distribution for a population of star 
clusters reflects, in principle, a combination of 
both formation and disruption rates. We inter- 
pret the decline of xi''') primarily as the result of 
disruption rather than formation for the following 
reasons. First, the age distribution has a sharp 
peak at the present time, t = 0, and a small width, 
characterized by the median age r ~ lO'' yr. This 
requires either rapid disruption at an arbitrary 
time (now) or rapid formation at a special time 
(now), the former being much more likely a priori 
than the latter. Of course, the Antennae galax- 
ies are presently merging, which has almost cer- 
tainly boosted the formation rates of stars and 
clusters, but this occurs more slowly, on the or- 
bital timescale of the galaxies, t ~ few x 10^ yr. 
In the simulations of merging galaxies by Mihos et 
al. (1993), including the Antennae, for example, 
the star formation rate varies by factors of only a 
few or less in the past 10'* yr, whereas the observed 
age distribution of the clusters drops by nearly 2 
orders of magnitude in this interval of time. The 
second reason we interpret the decline of x(t) in 
terms of disruption is that it has the same shape, 
including the sharp peak at r = 0, in large re- 
gions separated by distances of order 10 kpc within 
the Antennae galaxies (Whitmore 2004; WCF07). 
There is no physical mechanism that could syn- 
chronize a burst of cluster formation this precisely 
at these separations; the communication time is 
T ~ 10^ yr for a signal traveling at the typical 
random velocity in the interstellar medium (ISM; 
w ~ 10 km s~^) and r ~ 10® yr for one travel- 
ing at the highest bulk velocity {v ~ 100 km s~^). 
Thus, we conclude that the observed decline in the 
age distribution is caused mainly by disruption, at 



^Equation (2) follows from the fact that the mass-to-light 
ratios of clusters vary with age approximately as M/Ly oc 
^0.8 foj. ^ > iq7 yj. 
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least for r < few x 10^ yr. 

We have speculated that equation (1) for 
g{M, t) may also describe — approximately — the 
cluster populations in other galaxies (Fall 2006; 
WCF07). If this picture turns out to be gener- 
ally valid, it will mean that star clusters form and 
evolve in much the same way in different galaxies, 
the primary variable being an overall scale fac- 
tor proportional to the total number of clusters 
and hence to their average formation rate. We 
define a cluster here to be any concentrated ag- 
gregate of stars with a density much higher than 
that of the surrounding stellar field, whether or 
not it is gravitationally bound, since this is vir- 
tually impossible to determine from the available 
observations, especially for clusters younger than 
about 10 internal crossing times. Most, if not all, 
stars form in clusters (as just defined) and most 
clusters then dissolve into the; stellar field. We 
find it intriguing that this whole complex process 
might be represented approximately by a simple 
"recipe" such as equation (1). 

There are a few other hints that point toward 
this model: (1) The same mass and age distribu- 
tions, tp{M) oc and %(''') c< t~^, are found 
for clusters with M < lO^M© and r < 3 x 10^ yr in 
the solar neighborhood (Lada & Lada 2003). (2) 
The luminosities of the brightest clusters in dif- 
ferent galaxies scale approximately with the size 
of the population in the way expected from equa- 
tion (1) (Larsen 2002; Whitmore 2003; WCF07). 
(3) The mass spectrum of molecular clouds, from 
which the clusters form, appears to be similar in 
different galaxies (Blitz et al. 2007). (4) The most 
important early disnipticm processes, removal of 
ISM by stellar feedback and continued stellar mass 
loss, arc; independent of the properties of the host 
galaxy (except possibly its metallicity and stel- 
lar initial mass function (IMF)). Of course, a 
definitive conclusion about the generality of equa- 
tion (1) will require more detailed studies, such as 
that presented here for the Antennae, of the clus- 
ter populations in several other galaxies. We have 
recently made a start on this with the LMC and 
SMC (CFW09). Our main purpose in this paper 
is to present some of the interpretive tools needed 
for such studies. 

The plan for the remainder of this paper is the 
following: In §2, we present analytic formulae for 
g{M, r) for two models with mass-dependent dis- 



ruption and the model already mentioned with 
mass-independent disruption. In §3, we describe 
our HST observations of the Antennae clusters 
and compare these with the models presented in 
the previous section. We also derive the luminos- 
ity function of the Antennae clusters and show 
how this is related to the mass function through 
g{M,T). In §4, we discuss the physical processes 
most likely responsible for the formation and dis- 
ruption of clusters and how these processes affect 
the mass and age distributions. This interpreta- 
tive section presents our current understanding of 
the entire life cycles of star clusters. In §5, we 
summarize our main results and their broader im- 
plications. The paper also includes two appen- 
dices. The first (A) discusses some related work 
on the mass and luminosity functions of the An- 
tennae clusters by Mengel et al. (2005) and by 
Anders et al. (2007) respectively, while the second 
(B) presents some specialized formulae needed in 
§3. 

2. MODELS 

The basic element of a statistical description 
of a population of star clusters is the multivari- 
ate distribution of all the important properties of 
the clusters, such as their luminosities, masses, 
ages, effective radii, internal concentrations, stel- 
lar contents, and so forth. For a population of 
clusters more distant than ~ 10 Mpc, however, 
many clusters are poorly resolved or not resolved 
at all. and the list of available properties shrinks 
to just three: luminosities, masses, and ages, of 
which only two are independent if the stellar IMF 
is assumed to be universal, as it almost always is 
in practice. With these thoughts in mind, we focus 
here on the bivariate distribution g{M, t), defined 
such that g{M, T)dMdT is the number of clusters 
with masses between M and M -\- dM and ages 
between r and t + dr. This distribution can be 
estimated from fits of stellar population models to 
photometry in several bands of the clusters in a 
large sample, as we describe in the next section. 

The goal of this section is to relate g{M,T) to 
information about the formation and disruption of 
the clusters. This is done most easily with the help 
of a few auxiliary functions. We define another 
distribution /{Mq^t) such that f(Mi:),T)dModT is 
the number of clusters born with initial masses be- 
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twccn Mq and Afo + dMo at times in the past be- 
tween T and T + dr. Furthermore, we assume that 
the current mass M{Mo,t) of a cluster with an 
initial mass Mq and the inverse relation Afo(M, t) 
are known functions of the age r. Then / and g 
are related by the continuity equation in the form 

g{M,T) = f[Mo{M,T),T]{dMo/dM)^. (3) 

This generalizes equation (2) of Fall & Zhang 
(2001) from an instantaneous burst to an arbi- 
trary history of cluster formation. In equation (3) 
here, all information about formation is encoded 
in / (Mq, r), while all information about disruption 
is encoded in M{Mq,t) and hence in the differ- 
ence between / and g. In fact, these distributions 
are equal only if clusters are never disrupted (i.e., 
M = Mq for all r). 

We can exploit equation (3) in either of two 
ways. If we knew everything about the disruption 
mechanisms (likely more than one) and hence the 
precise form of M{Mq, t), we could use the empir- 
ical determination of g(Af, r) together with equa- 
tion (3) to infer /(Mo,t). In practice, however, 
we must work in the other direction: make reason- 
able assumptions about /{Mq, t), adopt a specific 
model for M{Mq,t), compute g{M,T) from equa- 
tion (3), and compare this with the corresponding 
empirical distribution to see if the model is accept- 
able or must be rejected. In this spirit, we assume 
that clusters form with the following initial mass- 
age distribution: 



/(Mo,r) = C(r)(Mo/M*) 



/3 



(4) 



Here C(t), with imits of [mass]"'^ [time]"'^, speci- 
fies the formation rate of clusters at an age r, while 
M* is a fiducial mass, usually taken to be 10^ Mq. 
Our assumption that the initial mass function is 
a power law is justified by our previous results 
and those presented in the next section of this pa- 
per. In all our detailed comparisons with observa- 
tions, we assume for simplicity that the formation 
rate C(t) in equation (4) is constant. For the rea- 
sons discussed in the Introduction, this should be 
a good approximation for the Antennae clusters 
younger than a few x 10^ yr. It is worth noting 
here, however, that the mathematical formulae for 
g{M,T) presented in this section [equations (6), 
(12), and (13) below] are equally valid with a vari- 
able formation rate, even one as extreme as an 



instantaneous burst, with C(t) cx S{t — tq). We 
also note that, although f{Mo,T) is a separable 
function of Mq and r (by assumption), g{M,T) 
will generally include correlations between M and 
r, except in the special case in which M/Mq is a 
function only of r. 

In the remainder of this section, we present the 
mass-age distribution g{M, r) for three simple, il- 
lustrative models for the disruption of clusters. In 
this context, it is important to bear in mind that 
several processes probably operate together to dis- 
rupt the clusters, with different combinations pre- 
dominating at different ages over the lifetimes of 
the clusters. Thus, we cannot expect the most suc- 
cessful model to display the signature of any single 
process operating in isolation. The massive young 
clusters in our sample are probably disrupted first 
by the removal of residual interstellar material 
driven by early stellar feedback (photoionization, 
radiation pressure, stellar winds, and supernovae), 
then by continued mass loss from the stars them- 
selves through stellar winds and other ejecta, and 
then possibly by the escape of stars driven by tidal 
disturbances of passing structures such as molec- 
ular clouds and/or spiral arms. The three models 
we consider have simple functional forms but sev- 
eral adjustable parameters to accommodate this 
likely complexity. We note here that the clusters 
in our sample are all too massive and too young 
to have lost a significant fraction of their mass as 
the result of stellar escape driven by internal two- 
body relaxation, the primary long-term disruptive 
process for clusters and one that certainly depends 
on mass (see §4.3 below). 

Model 1: Sudden Mass- Dependent Disruption. 
In this case, we assume clusters retain all of their 
initial mass until they are destroyed suddenly and 
completely at an age Td{Mo): 



M{t) 



Mo for T < Td(Mo) 







otherwise. 



(5) 



The corresponding mass-age distribution, from 
equations (3), (4), and (5), is 



g{M,T) 



C{M/M,f ioxT<Td{M) 







otherwise. 



(6) 



These equations are valid for any dependence of 
Td(Mo) on Mq. Boutloukos & Lamers (2003) have 
proposed a specific version of this model with a 
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power-law dependence on mass: 

TdiMo)^n{Mo/A'QK 



(7) 



The discontinuous evolution posited by this model 
is clearly an oversimplification. Boutloukos & 
Lamers (2003) claim that Model 1 does, neverthe- 
less, reveal the correct mass dependence of disrup- 
tion when it is compared with observations. They 
find the same exponent k w 0.6 but very differ- 
ent disruption timescales for the cluster pop- 
ulations in several galaxies. We are skeptical of 
these claims for reasons we explain in detail in §3 
and §4 (see also WCF07). 

Model 2: Gradual Mass- Dependent Disruption. 
In this case, we assume clusters lose mass gradu- 
ally according to the equations 



dM/dr = -M/Td{M), 
Td{M)^T,{M/Ahf. 



(8) 
(9) 



The similarity between the disruption timescales 
TdiMfj) and Td{M) in equations (7) and (9) makes 
Model 2 an analog of Model 1 but with continuous 
evolution (as noted also by Boutloukos & Lamers 
2003). It seems reasonable to suppose that the 
gradual evolution posited by Model 2 would make 
it more realistic and hence a better match to the 
observations than Model 1. Indeed, Model 2 has 
known physical justifications in two special cases, 
both of which apply to tidally limited clusters 
(with constant mean density). For = 0, it de- 
scribes the disruption of clusters by external grav- 
itational shocks, while for fc = 1, it describes the 
disruption of clusters by internal two-body relax- 
ation (Spitzer 1987). The first of these is poten- 
tially relevant to intermediate-age clusters, while 
the second is certainly relevant to old clusters, as 
we discuss in detail in §4. 

The solutions of equations (8) and (9) are 



M(r) 



Mo [l-A:(A/o/M,)-'=(T/n )]'/'' 

for k^O 
Afoexp(-T/T*) 

for A: = 0. 



(10) 

Thus, the derivative required in equation (3) is 

{dMo/dM)r = (Mo/My-'' for aU k. (11) 



Combining equations (3), (4), (10), and (11), we 
obtain the mass-age distribution 



for fc ^ 
C{M/AQf> e^p[if3 + l)iT/T,)] 
for fc = 0. 

(12) 

Several features of this equation are noteworthy. 
For fixed r and fc > 0, g{M,T) is a double power 
law with a low-M exponent fc — 1, a gradual bend 
at the mass given by r « Td{M), and a high-M 
exponent /3. For fixed M and k ^ 0, g{M,T) is 
constant for small r, has a knee at r « Td{M), 
and is a power law with an exponent (/? + 1 — fc)/fc 
for large r. However, for fc = 0, g{M^ r) is a single 
power law in M with an exponent j3 for fixed r, 
and declines exponentially in t with a timescale 
—T^/{(3 -f 1) (i.e., faster than any power law) for 
fixed M. 

Model 3: Gradual Mass-Independent Disrup- 
tion. In this case, we adopt the mass-age dis- 
tribution indicated by our earlier studies (ZF99, 
FCW05, Fall 2006, WCF07), namely 



g{M,T)=C{M/AUf{TlT,)\ 



(13) 



This distribution is manifestly separable: g{M,T) oc 
V'(M)x(t), with ip{M) cx {M/M^Y (independent 
of r) and %(''') oc (r/r,)'*' (independent of M). It 
has three independent, adjustable parameters: C, 
/3, and 7. The other parameters, M* and t* (which 
are not independent of C, /3, and 7), merely spec- 
ify the units in which masses and ages are mea- 
suredH We can derive equation (13) from equa- 
tions (3) and (4) with M(t) = Mq{t/t^)-''''^'^+^\ 
In this case, the masses of all clusters decline grad- 
ually, with the same power law in age. However, 
this interpretation of equation (13) is not unique. 
Another possibility is that g(M, r) specifies the 
probability that clusters of mass M survive to 
an age r, even if disruption, when it happens, is 
sudden. The form of 5(M, r) displayed in equa- 
tion (13) then implies that the survival probability 
is independent of mass and declines as a power law 
in age. Intermediate cases are also possible, with 
different combinations of age-dependent masses 



^Note that the parameter t» plays different roles in the three 
models; for Models 1 and 2, it specifies the characteris- 
tic disruption timescale for clusters of mass Af, , while for 
Model 3, it enters only as an arbitrary normalization factor. 
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and survival probabilities. Moreover, some clus- 
ters may lose only part of their mass (generally in 
the form of the least bound stars), while others 
are completely destroyed. We refer to all situa- 
tions described by equation (13) as gradual mass- 
independent disruption, irrespective of whether 
the disruption of individual clusters is gradual or 
sudden, partial or complete, because it leads to a 
gradual decline in the number of clusters of each 
mass with age at a fractional rate independent of 
massH 

The main justification for Model 3 is that it 
provides a simple, approximate representation of 
g{M^ t) for the massive young clusters in the An- 
tennae galaxies (in the mass-age domain specified 
by eqn. [2] above). It is likely that the power- 
law dependence of g{M, r) on M reflects the mass 
function of the molecular clouds in which the clus- 
ters formed, and an efficiency of star formation 
that is nearly independent of the masses of the 
clouds. The subsequent disruption of many of the 
young clusters by the removal of their ISM, stel- 
lar mass loss, and tidal disturbances also tends to 
preserve the power-law shape of the mass function. 
These effects plausibly predominate in the follow- 
ing sequence: (1) removal of ISM, r < 10^ yr; 
(2) stellar mass loss, lO^yr < r < lO^yr; (3) tidal 
disturbances, r > 10* yr. This combination of 
disruptive processes might then account (approx- 
imately) for the power-law dependence of g{M,T) 
on T. Variations in the formation rate could also 
play some role, although we expect this to be mi- 
nor, for the reasons discussed in the Introduction. 
We discuss these issues more fully in §4. 

3. OBSERVATIONS AND COMPAR- 
ISONS WITH MODELS 

The goal of this section is to compare the ob- 
served mass and age distributions of star clusters 
in the Antennae with predictions from the three 
disruption models developed in the previous sec- 
tion. In the next section, we discuss the physical 
implications of our results for the formation and 
disruption of the clusters, and in a companion pa- 
per we make similar comparisons between mod- 
els and data for clusters in the Magellanic Clouds 
(CFW09). 



When we mention rates in this paper, we usually mean 
fractional rates, i.e., d{lnX)/dt rather than dX/dt. 



3.1. Observations and Input Data 

We have derived the masses and ages of the 
Antennae clusters from UBVItla photometry of 
images taken with the WFPC2 on HST. These 
images cover nearly all of the main bodies of the 
Antennae galaxies, omitting only a small region in 
the northwest and the long tidal tails. Thus, all 
the results presented in this paper are essentially 
averages over the two merging galaxies. We de- 
scribe our procedure in detail in FCW05, includ- 
ing extensive tests of its validity, and the resulting 
age distributions for mass- and luminosity-limited 
samples of clusters. Here, we give only a brief sum- 
mary of our procedure but a more complete set 
of results, including the bivariate mass-age distri- 
bution and the univariate mass and age distribu- 
tions. The present work also updates the earlier 
results from ZF99 based on UBVI photometry of 
the same images, but with masses and ages esti- 
mated from two reddening-free Q parameters. 

In the present study, we determine the age r 
and extinction Ay of each object by performing 
a minimum fit between the observed UBVIHa 
magnitudes and those predicted by stellar popula- 
tion models. We use the Bruzual & Chariot (2003) 
models with solar metallicity, a Salpeter (1955) 
IMF with lower and upper cutoffs at 0.1 Mq and 
100 Mq, and the Calactic-type extinction law 
from Fitzpatrick (1999). Our procedure, which 
uses measurements in five filters (four colors) si- 
multaneously to estimate two parameters (r and 
Av) for each cluster, is thus over-constrained 
mathematically. The Ha flux varies rapidly with 
age near « 10^ yr, and is particularly useful 
in distinguishing clusters younger and older than 
this. We estimate the masses of the clusters from 
their total V^-band luminosity (corrected for ex- 
tinction) and the age-dependent mass-to-light ra- 
tio (M/Lv) from the stellar population models, 
assuming a distance of 19.2 Mpc to the Antennae. 
Figure 1 shows the resulting luminosity-age dis- 
tribution for all objects, and Figure 2 shows the 
equivalent mass-age distribution. If, instead of the 
Salpeter (1955) IMF, we had adopted the Kroupa 
(2001) or Chabrier (2003) IMF, all the masses 
would be reduced by about 40% while all the ages 
would remain virtually the same (see Fig. 1 of 
CFW09). 

The objects plotted in Figures 1 and 2 include 
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both individual stars and compact star clusters. 
If the Antennae were closer we could use struc- 
tural parameters alone to select the clusters, but 
not all of them are spatially resolved in the HST 
images (i.e. broader than the point-spread func- 
tion), and differentiating clusters from individual 
stars becomes increasingly difficult at faint mag- 
nitudes and in crowded regions. Therefore, fol- 
lowing the standard technique (e.g., Whitmore et 
al. 1999), we minimize contamination by restrict- 
ing our sample to objects brighter than all but 
the most luminous stars {Ly > 3 x IO^Lq or 
Mv < ~9)i resulting in stellar contamination at 
the < 5% level (ZF99), and giving a final sam- 
ple of w 2300 clusters. The solid lines in Fig- 
ures 1 and 2 show our cluster selection limit of 
My = —9. While our optically selected cata- 
log undoubtedly misses some very young, dust- 
enshrouded clusters, we estimate it to be complete 
at the > 85% level for clusters with r < lO'^ yr, 
based on positional coincidences between radio- 
continuum and optically selected sources (Whit- 
more &: Zhang 2002), and between infrared and 
optically selected sources (Whitmore et al. 2009). 

We have performed a variety of tests to as- 
sess the reliability of our age estimates. These 
include repeating the entire analysis using both 
the Bruzual & Chariot (2003) and the Starburst99 
(Leitherer et al. 1999) stellar population models, 
including and excluding the Ha measurements, 
and assuming a Galactic-type extinction law (Fitz- 
patrick 1999) and the starburst absorption curves 
of Calzetti et al. (1994). Most reddening val- 
ues Eb-v for Antennae clusters lie between 0.0 
and 1.0, with a median value « 0.3 mag, very 
similar to that determined by Whitmore et al. 
(1999) from reddening-free Q parameters. The 
standard error in logr for an individual cluster 
is ~ 0.3-0.4, corresponding to a factor of 2.0-2.5 
in T, determined from a comparison with spectro- 
scopic ages for > 20 Antennae clusters (FCW05; 
R. Chandar et al. 2009, in preparation). For most 
ages, the errors in logr are approximately sym- 
metric, causing little if any bias in the age dis- 
tribution. This is not true, however, for the very 
youngest clusters and those with ages in the range 
7.0 < log(r/yr) < 7.5, where the optical emis- 
sion from massive clusters is dominated by red 
supergiant (RSG) stars. During the RSG phase, 
the color tracks of the stellar population models 



loop back on themselves, and the fitted ages be- 
come degenerate, with a tendency to avoid values 
just above 10^ yr and a tendency to prefer slightly 
higher values. This well-known effect accounts for 
the relatively-empty vertical stripes in Figures 1 
and 2. As a result of this type of bias, there may 
be small-scale features in the observed age distri- 
bution that are not present in the real age distri- 
bution. We deal with this problem (in §3.3-3.5; 
see also FCW05) simply by binning on somewhat 
larger scales, A logr ~ 0.6-0.8, thus smoothing 
over any fine structure in the age distribution, 
whether real or artificial. Almost all studies of 
cluster age and mass distributions rely on broad- 
band optical measurements similar to those used 
here, and thus inherently face a similar problem. 
Future studies that include near-infrared photom- 
etry, which provides more reliable estimates of 
ages in the range 7.0 < log(T/yr) < 7.5, may be 
less affected by these difficulties. 

We now consider the uncertainties in our mass 
estimates. The random errors in the ages prop- 
agate into 1(7 uncertainties of « 0.3 in logM, 
or a factor of two in M. As we have already 
noted, the derived masses, but not the ages, of 
the clusters depend on the IMF in the stellar pop- 
ulation models. We have adopted the Salpeter 
(1955) IMF, mainly for ease of comparison with 
our earlier studies and those of others. If we had 
adopted the more modern Kroupa (2001) IMF or 
Chabrier (2003) IMF, which flatten below 1 Mq, 
all the masses in this paper would be reduced by 
a nearly constant (age-independent) offset of 40% 
(shown graphically in CFW09). Similarly, adopt- 
ing a shorter (longer) distance to the Antennae 
would systematically reduce (increase) the cluster 
masses, since they are derived from luminosities. 
It is important to note that none of these system- 
atic uncertainties affect the ratios of cluster masses 
or the shape of the mass fmiction presented in §3.2, 
and hence they do not affect the primary results 
of this paper. 

3.2. Mass and Luminosity Functions 

Figures 1 and 2 contain much of the statistical 
information about the population of clusters in the 
Antennae, embodied in the bivariate mass-age dis- 
tribution g{M,T) and its various projections, in- 
cluding the univariate mass, age, and luminosity 
distributions. In FCW05, we presented the age 
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distribution from this data set, and in this sec- 
tion, we present new determinations of the mass 
and luminosity functions. 

The mass function can be obtained by project- 
ing clusters diagonally, along the stellar popula- 
tion tracks in Figure 1, or cquivalcntly, horizon- 
tally in Figure 2. Figure 3 shows the mass func- 
tion for Antennae clusters in the two age intervals 
10^ yr < r < lO'^ yr and lO'^ yr < r < 10^ yr, 
which were chosen in part to minimize the influ- 
ence of errors in the estimated ages during the 
RSG phase mentioned above. We adopt a bin 
width of 0.4 in log M as a compromise between 
uncertainties in the mass estimates and adequate 
sampling of the mass function. We have checked 
that our mass function is not sensitive to the spe- 
cific (constant or variable) widths or locations of 
the bins, provided that they are at least ssO.2 wide 
in log M . The plotted range of masses for each age 
interval was chosen to lie above the stellar con- 
tamination limit, except for the lowest mass bins, 
which contain some objects fainter than this limit 
(fewer than 50%). We find that over this mass- 
age domain, the mass function declines mono- 
tonically with no obvious breaks or bends. It 
can be represented nicely by a pure power law, 
dN/dM oc M^, with (3 = -2.14 ±0.03 for clusters 
with 10^ yr < r < lO'^ yr and M > 10^ Mq, and 
P = -2.03 ± 0.07 for clusters with 10^ yr < r < 
10^ yr and M > 3 x 10* Mq, where the exponents 
and their la errors are based on least-square fits 
of the form log{dN/dM) = piogM + const. This 
power law extends at least up to lO^M© and pos- 
sibly up to 1O''M0. The true uncertainties in the 
exponents are larger than the quoted formal er- 
rors, probably A/3 w 0.2, based on our experience 
with different stellar population models, extinc- 
tion laws, and bin sizes. 

The mass function of the Antennae clusters pre- 
sented here is in excellent agreement with the ear- 
lier one from ZF99, based on the same WFPC2 ob- 
servations, and with a more recent one from Whit- 
more et al. (2009), based on new observations 
with the ACS on HST. However, it differs from the 
mass functions for the Antennae clusters presented 
by Fritze-von Alvensleben (1999) and Mengel et 
al. (2005), both of which exhibit substantial de- 
viations from power laws. Fritze-von Alvensleben 
found a lognormal-like mass function with a peak 
or turnover at ~ 10^ Mq. This feature, however. 



was close to the limit of the WF/PCl observations 
(with spherical aberration), and was subsequently 
shown to be an artifact of incompleteness by the 
deeper WFPC2 observations (ZF99). Mengel et 
al. (2005) found a mass function with a gradual 
bend at ~ 10^ Mq. However, this is based on a 
sample of clusters with a wide range of ages and 
a constant luminosity limit, rather than one that 
tracks the fading of the clusters (as in the ZF99 
and present studies). The Mengel et al. selection 
criteria therefore include younger low-mass clus- 
ters, but artificially exclude older low-mass clus- 
ters, those that have faded below the fixed lumi- 
nosity limit. The bend in the mass function then 
reflects this underrepresentation of low- mass clus- 
ters in the sample, not any physical process involv- 
ing the formation or disruption of the clusters. We 
demonstrate this explicitly in Appendix A, where 
we construct mass functions from our own sam- 
ple with selection criteria like those adopted by 
Mengel et al. 

It is also instructive to determine the luminos- 
ity function of the Antennae clusters and its re- 
lationship to the mass function. While the mass 
function is usually of greater interest in dynami- 
cal studies, the luminosity function can be derived 
from observations in fewer bands and is available 
for the cluster systems in more galaxies. Figure 4 
shows our determinations of the luminosity func- 
tion of the Antennae clusters, with and without 
corrections for extinction. These can be repre- 
sented by pure power laws, dN/dL oc L", with 
a = —2.09 ± 0.05 (corrected for extinction) and 
a = —2.37 ± 0.06 (not corrected for extinction), 
for Ly > 3 X lO^L©. While there may be minor 
deviations from these power laws, such as mild 
curvature, at the level Aa < 0.3, the reality of 
such features is doubtful, given that the true un- 
certainties on a are nearly as large. We conclude 
that the extinction-corrected luminosity function 
has the same or nearly the same power-law shape 
as the mass function, i.e., a w /3 « —2. 

Before we discuss the implications of this result, 
we note a contrary claim by Anders et al. (2007) . 
They find a peak in the y-band luminosity func- 
tion of the Antennae clusters near My ~ —8.5. 
This is based on a sample that excludes all clus- 
ters with uncertainties in the U-h&nd magnitudes 
greater than 0.2. However, these clusters certainly 
exist and should be included in any unbiased de- 
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termination of the luminosity function, as they are 
in the results presented here. When we apply the 
Anders et al. selection criteria to our own sam- 
ple, wc reproduce their luminosity function, as dis- 
cussed further in Appendix A. Thus, we conclude 
that the peak they have found is an artifact of 
their selection criteria. 

In a previous study of the Antennae clusters, we 
found some marginal evidence for a weak bend in 
the luminosity function near My ~ —10.4 (Whit- 
more et al. 1999). This was based on luminosi- 
ties without corrections for extinction. There is 
a slight hint of mild curvature in the uncorrected 
and corrected luminosity functions derived here, 
but we are skeptical of this feature because it ap- 
pears at about the same level as the true imccr- 
tainties in the exponent, namely Aa « 0.2. A 
recent analysis of deeper observations taken with 
the ACS on HST confirms the results presented 
here that the luminosity function of the Anten- 
nae clusters is well represented by a pure power 
law with a -2 (Whitmore et al. 2009). More- 
over, the luminosity functions of the star clusters 
in many other galaxies can also be approximated 
by power laws with a « — 2 (Larsen 2002). 

While the luminosity function is sometimes re- 
garded as a proxy for the mass function, this need 
not be true for clusters with a wide range of ages 
and hence mass-to-light ratios. For example, a 
(hypothetical) population of star clusters that all 
form with the same mass at a constant rate and 
experience no disruption will have a power-law lu- 
minosity function with a « — 2, very different from 
the assumed underlying delta-function mass func- 
tion (Fall 2006). Thus, the similarity of (t>{L) and 
il){M) we have found for the Antennae clusters 
should not be taken for granted. Instead, it must 
reflect a basic property of the mass-age distribu- 
tion g{M,T). In the special case that this distri- 
bution is separable, g{M, r) oc xIj{M)x{t)^ and the 
mass function is a power law, ip{M) cx , the age 
variable can be integrated out and the luminos- 
ity function will also be a power law, (t>(L) oc i", 
with the same exponent, a = (3 (Fall 2006). This is 
true regardless of the specific form of x(j)- There- 
fore, the fact that the observed mass and luminos- 
ity functions of the Antennae clusters are similar 
power laws is indirect support for the decomposi- 
tion of g(M, t) given in equation (1). Indeed, the 
assumption that masses and ages are independent 



of one another is the basis of Model 3, which, as 
we show in §3.5, provides a much better fit to the 
observations than Models 1 and 2. 

3.3. Comparison with Model 1: Sudden 
Mass-Dependent Disruption 

In this and the next two subsections, we com- 
pare predictions from the three disruption models 
developed in §2 with observations of the Anten- 
nae clusters. The simplest way to do this is in 
terms of the averages of .g(M, r) over several adja- 
cent intervals of mass and age. These averages or 
projections are given by the expressions 

g{M) = ^ [ ' g{M, T)dT, (14) 

An alternative approach would be to compare the 
models and observations in the M-r plane, using 
standard statistical methods (e.g., minimum or 
maximum likelihood) to derive a single goodncss- 
of-fit measure. This however, has the disadvan- 
tage that any systematic errors, like those from 
estimating ages in the RSG phase as discussed in 
§3.1, cause artificial non-random deviations from 
the models, for which it is not possible to assign 
meaningful values. Moreover, it turns out to 
be fairly obvious from g{M) and g(r) which mod- 
els match the observations and which do not. We 
present analytical formulae for g{M) and ^(r) in 
Appendix B for the three models discussed in §2. 
These are our main tools for interpreting the ob- 
servations in terms of formation and disruption 
processes. 

We first consider Model 1, which assumes 
that clusters are disrupted suddenly at a mass- 
dependent age Td{M) = r*(M/M*)'=. It is in- 
structive first to display the predictions of this 
model in the M-t plane. Figure 5 shows a Monte 
Carlo realization of a cluster population generated 
by Model 1 with /? = -2, r* = 5 x lO'^ yr, and 
k — 0.62. It has been suggested that this value of 
k may be ubiquitous among the cluster systems 
of different galaxies (Boutloukos & Lamers 2003). 
In this model, lower mass clusters are destroyed 
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sooner than higher mass clusters, resulting in a 
sharp, diagonal line beyond which no clusters sur- 
vive. The index k regulates the slope of this line 
(proportional to 1/fc), while t, controls its inter- 
cept. Model 1 predicts an evacuated, triangular 
region to the right of the disruption line (marked 
D in Fig. 5). However, no such feature appears in 
Figure 2, the observed M-t plane for the Anten- 
nae clusters. Thus, we can immediately rule out 
Model 1, at least for the mass-age domain studied 
here. 

Despite this failure of Model 1, it is still in- 
structive to compare the predicted and observed 
g{M) and ^(r) distributions as a prelude to tests 
of the more realistic Models 2 and 3. Figures 6 
and 7 show these comparisons for the same param- 
eters adopted in Figure 5. Evidently, the shapes 
of the predicted and observed g{M) distributions 
match for sufBciently large r, (i.e. r* > 5 x lO'' yr), 
ensuring that the predicted curvature occurs be- 
low the observed mass range. However, the am- 
plitudes cannot be matched; the predicted ampli- 
tude is constant with age because, in the model, 
there is no evolution until clusters are suddenly 
disrupted, whereas the observed amplitude de- 
creases substantially with age. The opposite prob- 
lem arises with the ^(t) distribution. In this case, 
the predicted and observed amplitudes agree (a di- 
rect consequence of the adopted initial mass func- 
tion, ^po{M) (X M~^), but the shapes are quite dif- 
ferent; the observations decline approximately like 
a power law in age, while the predictions remain 
flat for a while and then drop rapidly near the dis- 
ruption time Td for clusters with the relevant mass 
(Ml < M < M2). This behavior is generic: ^(r) is 
predicted to have a sharp bend at Td, but none is 
observed. Thus, we confirm our earlier conclusion 
that Model 1 does not describe the population of 
massive young clusters in the Antennae. 

3.4. Comparison with Model 2: Gradual 
Mass-Dependent Disruption 

Model 2 retains the power-law dependence of 
the disruption timcscalc on mass but smooths 
out the discontinuous evolution postulated by 
Model 1. As a result of this smoothness, one might 
guess that Model 2 would fare better against the 
observations than Model 1, but this turns out not 
to be true, as we now demonstrate. In Figures 8, 
9, and 10, we compare the predicted g{M) from 



Model 2 for fc = 0, 0.6, and 1.0, respectively, with 
our Antennae observations. In the limit k = 0, 
the disruption rate is independent of mass. In this 
case (Fig. 8), the predicted and observed g{M) 
match in shape for all values of r* but in ampli- 
tude only for r* « 1.4 x lO'' yr. However, with 
this value of (or any other), the predicted gi^r) 
fails to reproduce the observed one, as shown in 
Figure 11. The reason for this is that, with the 
disruption law postulated by Model 2 (eqs. [8] 
and [9]), there is always a bend near in M(t) 
(eq. [10]) and hence in ^(t) (eqs. [B6] and [B7]), 
whereas the observations show no such feature. In 
the mass-dependent cases, k = 0.6 (Fig. 9) and 
k = 1.0 (Fig. 10), the predicted g{M) matches the 
observed one in shape for large enough t* , so that 
the predicted curvature occurs below the observed 
mass range, but it never matches in amplitude 
for clusters both younger and older than 10^ yr. 
These are essentially the same problems that af- 
flicted Model 1, with relatively minor differences 
resulting from the different disruption laws (one 
sudden, one gradual). We conclude, therefore, 
that Model 2 also fails to describe the Antennae 
clusters in the mass-age domain studied here. 

This leads us to a more general consideration 
of mass-dependent disruption. We have shown 
that the observed mass function of star clusters 
in the Antennae has the same shape but different 
amplitude for different ages, while the observed 
age distribution has the same shape but different 
amplitude for different masses. These facts can- 
not be explained by mass-dependent disruption. 
In particular, if low-mass clusters were disrupted 
faster than high-mass clusters, the mass function 
would become flatter with increasing age; and con- 
versely, if disruption had the opposite dependence 
on mass, the mass function would become steeper. 
Neither effect is observed in the mass function of 
the Antennae clusters. Furthermore, there arc no 
discernible features in the observed mass and age 
distributions, as would be expected if the rate of 
disruption changed at a characteristic mass or age. 
We emphasize that these conclusions are not af- 
fected by the formation history of the clusters (i.e., 
whether they form at a constant or variable rate), 
provided only that the shape of their initial mass 
function is invariant. In this case, variations in 
the formation rate would affect only the ampli- 
tude of the mass function, but not its shape, at 
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different ages0 Thus, we conclude, quite gener- 
ally, that disruption of the massive young clusters 
in the Antennae has little or no dependence on 
mass. 

3.5. Comparison with Model 3: Gradual 
Mass-Independent Disruption 

Model 3, with mass-mdependent disruption, 
was designed to remedy the problems encountered 
in fitting Models 1 and 2 to the Antennae data. 
As we noted in Section 2, the disruption of the 
population of clusters is gradual in this model, al- 
though the disruption of individual clusters may 
be gradual or sudden. This model also has fewer 
adjustable parameters (C, /?, 7) than the other 
models (C, /3, k, r,). In Model 3, the averages 
over g{M,T) are pure power laws: g{M) oc M^^ 
and ^(t) oc t"*". As Figures 12 and 13 show, 
the predicted and observed g{M) and ^(t) agree 
nicely, both in shape and amplitude, for (3 = —2.0 
and 7 = —1.0. These results corroborate those 
from our earlier studies of the Antennae clusters 
(ZF99, FCW05, WCF07). Model 3 provides an 
excellent description of the population of massive 
young clusters in the Antennae. We note that the 
domain of validity of this result — the region of the 
M-T plane above the stellar contamination limit — 
covers a relatively wide range of masses for very 
young clusters but becomes progressively narrower 
with increasing age, as shown in Figure 2 and 
approximated by equation (2). Thus, we regard 
Model 3 as a robust description of the observations 
for T < 10^ yr and more akin to an extrapolation 
for T > 10^ yr. 

4. INTERPRETATION AND IMPLICA- 
TIONS 

We showed in the previous section that the 
joint distribution of masses and ages of massive 
young clusters in the Antennae can be approxi- 
mated by the simple formula g{M,T) oc M~^t~^. 
This means that the shape of the mass function 
tp{M) oc is independent of age and that the 

shape of the age distribution x(r) oc is inde- 
pendent of mass (at least over the observed do- 



^ The fact that the shape of the mass function is not al- 
tered by a variable rate of cluster formation also follows 
from equations (3) and (4), where C(r) appears only as a 
prefactor in g(M, r). 



main of M and r). We now seek to understand 
these results in terms of the physical processes that 
operated during the formation and early evolution 
of the clusters. We also consider the long-term fu- 
ture evolution of the young clusters and whether 
their mass function will eventually resemble that 
of old globular clusters. The picture presented 
here is an extension of the one proposed by Har- 
ris & Pudritz (1994) and Elmegreen & Efremov 
(1997) and developed in some of our earlier pa- 
pers (ZF99; Fall & Zhang 2001; FCW05). 

4.1. Formation of Star Clusters from Molec- 
ular Clouds 

The mass functions of young star clusters must 
depend in some way on the mass functions of the 
molecular clouds in which they formed. Blitz et al. 
(2007) review CO observations of molecular clouds 
in several galaxies in the Local Group: the Milky 
Way, LMC, SMC, M31, M33, and ICIO. The 
mass functions of the clouds can be represented 
by power laws, dN/dM oc M^^, with (3 « -1.7 
in most cases. The exceptions are M33, which 
has /3 « —2.5, and the SMC, which has too few 
clouds to estimate (3. The range of masses cov- 
ered by these observations varies substantially, 
from a few x 10^ Mq < M < 10^ Mq for M33 to 
10^ Mq < M < few X 10^ Mq for the Milky Way. 
For complexes of molecular clouds in the Anten- 
nae, Wilson et al. (2003) find /3 « -1.4 (for 
Af > 10^ Mq). While these estimates of /? are 
suggestive, it is worth noting that they pertain to 
clouds with relatively low column density thresh- 
olds (compared with the regions of most intense 
star formation) and, in the case of the Antennae, 
may also suffer from blending as a result of the 
larger distance and hence lower spatial resolution. 

It is also interesting at this stage to recall some 
results for protoclusters in other galaxies. Studies 
of HII regions and OB associations in the Milky 
Way and many nearby galaxies show that their 
Ha and Lyc luminosity functions have power-law 
form, dN/dL oc L", with exponents near a = —2 
(Kennicutt et al. 1989; McKee & Wilhams 1997). 
There appear to be small differences among the 
exponents for different galaxies and in some cases 
a high-luminosity turnover or cutoff. As we have 
noted before, luminosity and mass functions in 
general are not the same, due to the spread in 
ages and fading of the clusters. However, for HII 
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regions and OB associations, the age spread is rel- 
atively narrow (r < 10^ yr), the M/L variations 
are modest, and luminosity functions become ef- 
fective proxies for mass functions. Dowell et al. 
(2008) have derived the mass functions of young 
clusters (r < 2 x lO'' yr) in several nearby galax- 
ies based on broadband multi-color photometry 
(by a method generally similar to ours but us- 
ing ground-based rather than HST data). For 
M > few X lO^M©, they find dN/dM oc M^, with 
(3 = —1.9 for their combined sample of irregular 
galaxies and (3 — —1.8 for their combined sample 
of spiral galaxies. We find (3 ~ —1.8 for the young 
clusters in the LMC (CFW09). 

It is remarkable that all these mass functions 
are power laws with nearly the same exponent, 
(3 ~ —2. There may be small but real differences 
between the mass functions of molecular clouds 
and young star clusters in general and/or between 
the mass functions of either clouds or clusters in 
different galaxies, at the level A/3 « 0.2-0.6. The 
significance of these differences is low, typically 1- 
3 cr, based on the statistical and likely systematic 
uncertainties in (3. We also note that the differ- 
ent mass functions were determined by different 
researchers using different methods and that they 
pertain to different density thresholds and mass 
ranges. Thus it is possible that they are all basi- 
cally the same, or more conservatively, that any 
differences between them are barely detectable. 
This means that the average efficiency of star for- 
mation in protoclusters — the ratio of the final stel- 
lar mass to the initial interstellar mass — is approx- 
imately independent of the mass of the protoclus- 
tersH If the efficiency varies at all, it decreases 
with increasing mass, since the mass functions of 
molecular clouds appear to be slightly flatter than 
the mass functions of young clusters. On aver- 
age, low-mass protoclusters make stars just about 
as efficiently as high-mass protoclusters. This is 
a simple but important empirical result, which 
should help to guide theories of star and clus- 
ter formation. Indeed, it probably reflects a bal- 
ance between star formation and stellar feedback 
within protoclusters (Elmegreen & Efremov 1997; 
Krumholz et al. 2006). 



^In principle, the relation between the mass functions of 
the clouds and clusters also depends on the lifetimes of the 
clouds, although these are expected to have at most a weak 
dependence on mass (Matzner & McKee 2000). 



A complete understanding of the mass func- 
tions of young star clusters clearly requires an un- 
derstanding of the mass functions of their progen- 
itor molecular clouds. The ISM is often described 
as a turbulent medium in which the density and 
velocity fluctuations on each spatial scale mimic 
those on other scales in a self-similar hierarchy (see 
the reviews by Elmegreen & Scalo 2004 and Mc- 
Kee & Ostriker 2007). Given this picture, it is 
not surprising that the mass function of molecu- 
lar clouds is a power law. The important question 
is what physical processes determine the exponent 
(3. Fleck (1996) has derived (3 = —2 for relatively 
diffuse, non-self-gravitating clouds based on a sim- 
ple analytical model of compressible turbulence in 
the ISM. Wada et al. (2000) flnd /3 = -1.7 in 
their 2D hydrodynamic simulations of ISM turbu- 
lence without stellar feedback and slightly steeper 
mass functions when feedback is included (see also 
Vazquez-Semadeni et al. 1997). Elmegreen (2002) 
finds (3 K, —2 in his static models of a fractal ISM, 
and slightly steeper (shallower) mass functions for 
clouds with higher (lower) density thresholds. De- 
spite these results, it is not yet entirely clear why 
the mass function of real molecular clouds has 
(3 K. —2 rather than /3 w — 3 or /3 ss — 1 (say). 
Explaining the physical origin of this exponent re- 
mains an important theoretical challenge. 

4.2. Early Disruption: Processes and 
Timescales 

Interstellar material is removed from protoclus- 
ters by feedback from massive stars (photoion- 
ization, radiation pressure, stellar winds, and su- 
pernovae). Observationally, the dividing line be- 
tween protoclusters that contain interstellar ma- 
terial and those that do not is roughly r ^ 10^ yr, 
based on the coincidence of molecular clouds and 
HII regions with the locations of clusters of differ- 
ent ages (Blitz et al. 2007). This timescale also 
makes sense from a physical point of view; it is 
a few multiples of the lifetimes of the most mas- 
sive stars and about an order of magnitude longer 
than the dynamical or crossing times within the 
clusters. An important consequence of this early, 
rapid mass loss is that many of the protoclusters 
cannot remain gravitationally bound and there- 
fore begin to dissolve almost as soon as they form 
(Hills 1980), a process often called "infant mor- 
tality." In reality, the removal of ISM by stel- 
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lar feedback will unbind some protoclustcrs com- 
pletely and others only partially, leaving behind 
less-massive, but bound remnants. This process 
obviously affects the age distribution and licncc 
the amplitude of the mass function of young clus- 
ters. But does it also affect the shape of the mass 
function? 

It has been proposed that ISM removal disrupts 
a higher proportion of low-mass clusters than high- 
mass clusters, and that this introduces a bend 
or flattening in the mass function at ~ lO^M© 
(Kroupa & Boily 2002; Baumgardt et al. 2008; 
Parmentier ct al. 2008). We believe these claims 
are based on unrealistic models of protoclusters 
(see below). However, it is an easy matter to check 
observationally whether or not they are correct, by 
comparing the mass functions of Antennae clus- 
ters in the two age bins divided at r = lO'' yr 
(Figure 3). These show no hint of a bend; the 
power-law form of the mass function in the older 
bin appears identical to that in the younger bin, 
as we have already noted. If infant mortality im- 
prints any feature in the mass function at all, it 
must be well below IO^Mq and probably below 
IO^Mq (to avoid any hint of curvature). We ob- 
tain even stronger constraints on mass-dependent 
disruption in our recent study of the clusters in 
the LMC (CFW09). 

The shape of the mass function of young clus- 
ters depends on how effectively stellar feedback 
removes the ISM from protoclusters of different 
masses. This in turn depends on the fraction of the 
input energy that is radiated away from the pro- 
toclusters as a function of their mass. Most of the 
internal kinetic energy of molecular clouds is in the 
form of supersonic turbulence, which is dissipated 
in strong shocks on the dynamical timescales of 
the clouds (McKec & Ostriker 2007 and references 
therein). This important energy sink is not in- 
cluded in the models proposed by Kroupa & Boily 
(2002), Baumgardt et al. (2008), and Parmentier 
et al. (2008). Thus, it is Hkely that they have 
overestimated the energy available for disrupting 
protoclusters, possibly by large factors. In the 
limit of no radiative losses, the feedback is energy- 
driven, whereas in the opposite limit of maximum 
losses, it is momentum-driven. The second regime 
is likely to be more realistic than the first, and 
further analysis is needed to determine whether 
such feedback would imprint any features on the 



mass function of young clusters. Preliminary in- 
dications are that it would not (S. M. Fall et al. 
2009, in preparation). 

The preceding discussion of infant mortality 
presumes that the molecular clouds in which star 
clusters formed were gravitationally bound for at 
least some time before the onset of stellar feed- 
back. This picture, however, is now debated (see 
the reviews by Elmegreen & Scalo 2004 and Mc- 
Kee & Ostriker 2007). It is possible instead that 
some of the sites of star and cluster formation are 
merely transient molecular concentrations created 
by convergent flows in the interstellar turbulence. 
The protoclusters produced at these sites might be 
unbound from the beginning, irrespective of any 
stellar feedback that occurs within them. This 
cannot be the whole story, of course, because at 
least a small fraction of clusters — those that sur- 
vive for many of their own crossing times — must 
have formed as gravitationally bound objects. The 
revised picture in which some protoclusters are ini- 
tially bound while others are unbound has not yet 
been studied in enough detail to make firm predic- 
tions about the shape of the resulting mass func- 
tion of the surviving clusters. In a scale-free (frac- 
tal) ISM, however, there is no immediate reason to 
suspect that the fractions of bound and unbound 
clusters would change abruptly at a preferred mass 
scale or would even depend on mass at all. 

The young clusters will continue to lose mass 
through stellar winds and other ejecta even af- 
ter they have expelled all their nascent interstel- 
lar material. Stellar evolution alone depletes the 
masses of star clusters by ~40% over a period of 
a few X 10*^ yr, although most of this mass is lost 
in the first few x lO*" yr, the exact fraction and 
timcscale depending on the shape of the stellar 
IMF. This process can unbind clusters in a tidal 
field if they arc already weakly bound, with con- 
centration parameters [c = \og{rt/rc)] in the range 
c < 0.7 (corresponding to dimcnsionless central 
potentials in the range Wq < 3; Chernoff & Wein- 
berg 1990; Fukushige & Heggie 1995; Takahashi & 
Portegies Zwart 2000). As a result of the prior re- 
moval of interstellar material, we expect many of 
the surviving clusters to be only weakly bound and 
thus vulnerable to disruption by subsequent stellar 
mass loss. If the concentration parameters of the 
clusters are uncorrelated with their masses (the 
simplest possibility), a large fraction of them could 
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be disrupted in the period 10^ yr ^ t ^ 10* yr, 
without altering the power-law shape of the mass 
function, consistent with our observations of the 
Antennae and LMC chisters. In contrast, Ves- 
perini & Zepf (2003) have argued that this pro- 
cess would produce a "bell-shaped" mass function. 
They base this claim on an assumed strong corre- 
lation between concentration and mass for young 
clusters, a hypothesis for which there is no physi- 
cal explanation or observational evidence. The old 
globular clusters in the Milky Way do have such 
a correlation, but this is almost certainly a prod- 
uct of internal dynamical evolution over a Hubble 
time, including core collapse, tidal heating, and 
so forth. In any case, the similar shapes of the 
mass functions of clusters younger and older than 
10^ yr in both the Antennae and the LMC tell 
heavily against this suggestion. 

Another mechanism for disrupting star clus- 
ters is tidal disturbances by passing molecular 
clouds (Spitzer 1958). We follow the comprehen- 
sive treatment by Binney & Tremaine (2008, Sec 
8.2) and distinguish two regimes: catastrophic, in 
which clusters are disrupted suddenly by a sin- 
gle strong encounter, and diffusive, in which clus- 
ters are disrupted gradually by a sequence of weak 
encounters. The disruption timescales in these 
regimes (up to proportionality) are 

1/2 

td oc — (catastrophic regime), 

MpTlp 

(16) 

td OC — ^ — (diffusive regime). 

MpTlp 

(17) 

Here, ph = 3M/(87rr^) is the mean density within 
the half-mass radius rt of the clusters, Mp, rhp, 
and Hp are the mass, half-mass radius, and mean 
number density of the molecular clouds (the per- 
turbers), respectively, and Urei is the dispersion 
of the relative velocities between the clusters and 
clouds. Equations (16) and (17) display two im- 
portant consequences of tidal disruption: (1) In 
both regimes, td depends on the masses M of 
the clusters only through their mean internal den- 
sities ph; (2) In the catastrophic regime, td de- 
pends on the properties of the molecular clouds 
only through their mean smoothed-out density, 
Pp = MpTip. For open clusters in the solar neigh- 
borhood, Binney & Tremaine (2008) estimate td ~ 



3 X 10* yr (in the catastrophic regime). This might 
provide a rough indication of what to expect in 
the Antennae galaxies, since ph and pp might also 
be similar to their local values. If so, tidal dis- 
turbances would be effective in disrupting clusters 
older than ~ 10* yr in the Antennae. Despite the 
large uncertainty in the timescale for this process, 
we do not expect it to alter the shape of the mass 
function, since the mean internal densities of the 
clusters are fixed mainly by the galactic tidal field 
and should therefore be approximately the same 
for clusters of different masses. 

We argued in the Introduction that the age dis- 
tribution x(r) primarily reflects the disruption his- 
tory rather than the formation history of the clus- 
ters. The disruption history in turn is likely dom- 
inated by different physical processes in different 
intervals of age. Based on our previous discus- 
sion, the following sequence seems plausible: (1) 
disruption by removal of ISM, r < 10'' yr; (2) dis- 
ruption by stellar mass loss, 10''yr < r < 10*yr; 
(3) disruption by tidal disturbances, r > 10* yr. 
It also seems likely that the rate of disruption and 
hence the shape of the age distribution varies from 
one of these regimes to another, with some sort of 
features at the transitions between them. How- 
ever, this combination of disruption processes in- 
volves too many unknowns to predict how promi- 
nent such features would be. As we have already 
noted, the uncertainties in the ages of the clus- 
ters preclude the measurement of fine structure 
in the age distribution, especially near the likely 
transition between disruption dominated by ISM 
removal and by stellar mass loss, at r ~ 10'' yr. 
Moreover, the tail of x{t) potentially dominated 
by tidal disturbances, at t > 10* yr, is based on 
observations near the limit of our sample, where 
the accessible range of masses is relatively small. 
Plausible variations in the formation rate could 
also make the age distribution slightly flatter or 
slightly steeper. Thus, it is likely that our power- 
law model, xi'T') t~^, is a simple approximation 
to a complex situation involving several different 
physical processes rather than an exact descrip- 
tion of a single process. In any case, since the 
shape of the mass function is preserved by each of 
these processes, it will also be preserved by any 
combination of them. 
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4.3. Late Disruption of Clusters: Evapo- 
ration 

As we already noted, a primary motivation for 
the present study has been to understand the 
formation and early evolution of star clusters in 
an environment similar to that during the early 
phases of the hierarchical assembly of galaxies, 
when the clusters that are now classified as old 
globular clusters were originally produced. Thus, 
we are led to speculate on the long-term future 
evolution of the young clusters in the Antennae, 
those that survive the first ~ 10^ yr, over the 
next ~ 10^° yr or so. A major issue here is that 
the mass functions of the young and old clusters 
are very different, and this immediately raises the 
question whether the former would evolve by long- 
term disruptive processes into the latter. 

The upper panel of Figure 14 shows the mass 
functions of the young clusters in the Antennae 
and the LMC (both samples restricted to r < 
10® yr). Here, we have plotted dN/d\ogM rather 
than dN / dM , as in our previous figures. The mass 
function for the Antennae clusters is taken from 
the present paper, while that for the LMC clus- 
ters is taken from our companion paper (CFW09), 
with a vertical shift to allow for the different sizes 
of the two populations. Both functions have the 
same power-law shape, although they cover differ- 
ent ranges of mass (higher in the Antennae, lower 
in the LMC). The upper smooth curves in both 
panels of Figure 14 are the same Schechter func- 
tion, i)o{M) oc M''exp(-M/Mc), with the pa- 
rameters (3 = -2 and = 2 x IQ^Mq. We 
have appended the subscript to to indicate 
that we will adopt this as the "initial" mass func- 
tion in the calculation that follows. Evidently, we 
can represent the mass functions of young clusters 
over the observed range of masses equally well by 
a pure power law or a Schechter function with a 
sufficiently large cutoff Mc- In the present con- 
text, the Schechter function is preferable because 
it provides a better fit to the high-mass end of the 
mass function of old globular clusters, shown in 
the lower panel of Figure 14 (see also Burkert & 
Smith 2000; Jordan et al. 2007). 

The most important long-term disruptive pro- 
cess operating on clusters is the gradual escape 
of stars driven by internal two-body relaxation 
("evaporation"). This process depletes the mass 



of each cluster approximately linearly with time, 
M{t) K, Mq — Hevt, at a rate fiev that depends 
primarily on the mean internal volume or surface 
density of the cluster (/ie„ oc p]^^ with pt oc M /r^ 
for standard evaporation, p^v S^^^ with oc 
Mjrl for retarded evaporation; see Fig. 6 and 
eq. (12) of Baumgardt & Makino 2003 for the lin- 
earity of M{t) and McLaughlin & Fall 2008 for the 
dependence of pev on pt and Ef). The mass func- 
tion of clusters with the same internal density at 
an age r is then related to the initial mass function 
by the simple formula 

V'(M,r) « Vo(M + Aie.r) (18) 

oc (A/ + p^^Tf exp[-(M + /ie^,T)/Mc] 

(FaU & Zhang 2001; Jordan et al. 2007). For a 
population of clusters with different internal den- 
sities, and hence evaporation rates, the mass func- 
tion is a superposition of terms like the one above, 
each with its own value of /^eu- Here, we approx- 
imate this superposition by a single term with 
a typical value of /ig^ for the whole population, 
which we have checked is sufficiently accurate for 
our present, illustrative purposes (see McLaugh- 
lin & Fall 2008 for a comprehensive discussion of 
these issues). 

Two other long-term disruptive processes that 
are sometimes invoked are gravitational shocks 
(during passages through a galactic disk or near 
a galactic bulge) and dynamical friction. Gravita- 
tional shocks may be important for some massive 
clusters, but they are negligible compared with 
evaporation for low-mass clusters, in particular, 
those with masses near and below the turnover 
in the mass function i.e., M < 10^ Mq (Fall & 
Zhang 2001). Furthermore, gravitational shocks 
alone cannot change the shape of the mass func- 
tionlfl Dynamical friction is only important in a 
massive galaxy, such as the eventual remnant of 
the merging Antennae, for a few very massive clus- 
ters located very near the galactic center and can 
safely be neglected for all the other clusters in a 
first approximation (Fall & Zhang 2001; Binney & 
Tremaine 2008). Tidal encounters with molecular 
clouds can also be neglected once the clusters are 
dispersed out of the disks of the merging galax- 



In the notation of §2, with the disruption timescale written 
in the form tj^{M) qc M^, evaporation corresponds to fc = 1 
and shocks to fe = 0. 
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ies and into the spheroid or halo of the remnant 
galaxy (at r ~ 10^ yr or earlier). 

The lower panel of Figure 14 shows the mass 
functions of the old globular clusters in the Milky 
Way and the Sombrero galaxy, again plotted 
as dN/d\ogM. Both mass functions are de- 
rived from the corresponding luminosity functions, 
based on data from Harris (1996, as updated 
at http: / /www.mcmaster.ca/~harris/mwgc.dat) 
and Spitler et al. (2006), respectively, with an as- 
sumed mass-to-light ratio M/Ly = 1.5Mq/ Lq, 
a typical value obtained in dynamical studies 
(McLaughlin 2000). Again, the mass functions 
were shifted vertically to allow for the differ- 
ent sizes of the populations; and again both 
functions have similar shapes, although they 
cover somewhat different ranges of mass. With 
increasing mass, these functions first rise ap- 
proximately linearly {dN/d\ogM oc M, corre- 
sponding to dN/dM K, const), reach a peak at 
Mp w (1 - 2) X 10^ Mq, and then decline steeply. 
The five solid curves in each panel of Figure 14 are 
the predictions of the simple evaporation model, 
equation (18) with /ie„ ~ 2 x IQ^^Mq yr~^ and 
T = 0, 1.5, 3, 6, and 12 GyrQ This value of 
jjLev corresponds to an escape rate of 5%-10% per 
relaxation time, as indicated by various Fokker- 
Planck, Monte-Carlo, and A^-body simulations of 
the evaporation of tidally limited clusters, when 
combined with the typical observed internal den- 
sities of the globular clusters in the Milky Way 
and Sombrero galaxies (Chandar et al. 2007; 
McLaughhn & FaU 2008). Thus, the horizontal 
position of the models in Figure 14 is fixed to 
within ±0.2 or less in log Af. The vertical position 
of the models is fixed by the high- mass end of the 
observed mass function, where evolution can be 
neglected. 

Evidently, the model i^iM^r) at r = 12 Gyr 
coincides remarkably well with the observed mass 
function of old globular clusters over the full range 
of masses, from - lO^M© to above - IO^Mq. 
This is a very simple and gratifying result: start- 

^The peak mass of Mi/'(M) increases approximately linearly 
with age: Mp ^ fievT 2 X 10^ (T/10^°yr)MQ (valid for 
Mp <C Mc). The line specified by this equation lies well 
below the luminosity limit that defines our sample of An- 
tennae clusters in the mass-age plane, i.e., below the solid 
curve in Fig. 2. Thus, we would not expect to see any 
evidence for such a feature in the mass functions of these 
clusters, consistent with the power laws shown in Fig. 3. 



ing with a population of massive young clusters 
like those in the Antennae and the LMC, and then 
letting evaporation take its toll for about 12 Gyr, 
leads inevitably to a population of old globular 
clusters like those in the Milky Way and the Som- 
brero galaxy. No other ingredients or mechanisms 
are needed. This conclusion contrasts sharply with 
some recent suggestions that the turnover or bend 
at M ~ 10^ Mq in the evolved mass function of 
old globular clusters is simply a relic of a simi- 
lar feature in the mass function of young clusters 
(see, for example, Parmentier et al. 2008). There 
are two problems with this explanation: (1) The 
postulated bend in the "initial" mass function con- 
flicts directly with observations, as we have shown 
here for the Antennae clusters (and was shown ear- 
lier by ZF99) and in a companion paper for the 
LMC clusters (CFW09). (2) The claimed mech- 
anisms for producing a bend in the mass func- 
tions of young clusters are either physically unre- 
alistic (neglecting radiative dissipation of turbu- 
lence), or unjustified (assuming a strong correla- 
tion between mass and concentration). We note 
that the model with a power-law initial mass func- 
tion (i.e., without a bend) is both simpler and 
fully consistent with all relevant observations of 
old globular clusters, including the dependence of 
their evolved mass function on their internal den- 
sities and galactocentric positions (Chandar et al. 
2007; McLaughlin & Fall 2008)11 

5. SUMMARY AND CONCLUSIONS 

In this paper, we have derived analytical formu- 
lae for the bivariate mass-age distribution g{M, t) 
for three idealized models of the formation and 
disruption of star clusters. We have also de- 
rived formulae for the corresponding averages of 
g{M,T) over finite intervals of M and r, de- 
noted by ^(t) and g{M) respectively. In all three 
models considered here, clusters are assumed to 



The present orbital eccentricities of old globular clusters 
are sometimes invoked as a constraint on the shape of their 
initial mass function (Fall & Zhang 2001; Vesperini et al. 
2003; Baumgardt et al. 2008). This constraint, however, 
is extremely weak in practice, because it is based on the 
simplifying assumption that the galactic potential in which 
the clusters orbit is spherical and static, an assumption 
that is not consistent with the hierarchical formation and 
evolution of galaxies (see Fall & Zhang 2001; Chandar et 
al. 2007; McLaughlin & Fall 2008 for further discussion of 
this point). 
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form with a power-law initial mass function at 
a constant rate. In the first two models, pro- 
posed by Boutloukos & Lamers (2003), clusters 
are disrupted on a timescale that depends on 
their masses as Td{M) oc M'' with A; > 0, either 
suddenly (Model 1) or gradually (Model 2). In 
the third model, clusters arc disrupted (suddenly 
or gradually) at a fractional rate independent of 
their masses, as indicated by our earlier studies of 
the Antennae clusters (ZF99, FCW05, WCFG7). 
Model 3 has the remarkably simple bivariate dis- 
tribution g{M, t) oc M'^t''. The corresponding lu- 
minosity, mass, and age distributions are all pure 
power laws: (j){L) oc L", ip{M) oc M^, x(t) oc . 

We have compared Models 1, 2, and 3 with 
the empirical distributions of luminosities, masses, 
and ages for a large sample of star clusters in 
the Antennae galaxies. These are based on our 
UBVIHa photometry of point-like sources in im- 
ages taken with the WFPC2 on HST and compar- 
isons with stellar population models to estimate M 
and T for each soiircc (after correcting for inter- 
stellar reddening). To distinguish clusters of stars 
from individual stars, we have restricted our sam- 
ple for analysis to objects brighter than My = —9, 
corresponding to L > 3x lO^L©. This limit defines 
the domain of validity of the models in the L-t and 
M-T planes, the regions above the solid curves 
in Figures 1 and 2 respectively, corresponding ap- 
proximately to r < lO^(M/lO^M0)i-3 yr. Thus, 
all the results presented in this paper pertain to 
relatively massive and relatively young clusters 
(except in §4.3). 

We find that Model 3, with mass-mdependent 
disruption, provides a good match to the ob- 
served mass-age distribution of the Antennae clus- 
ters. The best-fitting exponents in this model are 
/3 « — 2 and 7 — 1. Models 1 and 2, with mass- 
dependent disruption fare much worse. Even with 
complete freedom to adjust several parameters, 
these models never come close to matching the 
data. While our detailed comparisons are based 
on models in which the formation rates of clusters 
are constant, our main conclusion, that the dis- 
ruption rates do not depend on mass, is valid even 
if the formation rates are variable. As a check on 
these results, we have also rederived the luminos- 
ity function of the Antennae clusters. After cor- 
rections for interstellar extinction, we find a good 
fit to a pure power law, (j>{L) oc L", with a « —2. 



The fact that the mass and luminosity functions 
are nearly identical power laws (a ~ /?) is a conse- 
quence of — and additional support for — the weak 
or nonexistent correlations between masses and 
ages, i.e., the decomposition (/(M, r) oc tp{M)x{T). 
We have also investigated recent claims that there 
arc bends or other features in the mass and lumi- 
nosity functions of the Antennae clusters (Fritze- 
von Alvensleben 1999; Mengel et al. 2005; Anders 
et al. 2007). We show that these features are ar- 
tifacts of the ways the samples are defined and do 
not reflect physical processes involved in the for- 
mation and disruption of the clusters. 

In an efi^ort to understand the physical basis for 
our simple — but doubtless approximate — model 
for g(M, t), we first note that the resulting mass 
function, tjj{M) oc M~^, resembles that of molec- 
ular clouds in nearby galaxies. This indicates that 
the efficiency of star formation in the clouds is 
roughly independent of their masses. We also 
consider a variety of processes that could disrupt 
the protoclusters and clusters: removal of ISM by 
stellar feedback, continued stellar mass loss, tidal 
disturbances by passing molecular clouds, grav- 
itational shocks during rapid passages near the 
galactic bulge and through the galactic disk, or- 
bital decay into the galactic center caused by dy- 
namical friction, and stellar escape driven by in- 
ternal two-body relaxation. We suggest that the 
massive young clusters in the Antennae galax- 
ies are disrupted in the following approximate se- 
quence: (1) ISM removal, r < 10'' yr, (2) stellar 
mass loss, 10''yr < r < lO^yr, (3) tidal distur- 
bances, T > 10^ yr. We have argued on theoret- 
ical grounds that these processes would operate 
at rates roughly independent of the masses of the 
clusters, consistent with our empirically-based de- 
composition g(M,T) oc '4){M)x{t) (which, how- 
ever, is only well established for r < 10^ yr). 
In combination, these processes plausibly account 
for the observed decline of the age distribution, 
X(t) oc t~^. In the longer term, after 12 Gyr or 
so, the escape of stars driven by two-body relax- 
ation will preferentially disrupt low-mass clusters 
and imprint a peak or turnover in the evolved mass 
function at Mp w (1 — 2) x lO^M©, similar to that 
for old globular clusters in the Milky Way and 
other galaxies. 

How general is this picture? In the Intro- 
duction, we mentioned some circumstantial evi- 
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dence in favor of its wide applicability: the statis- 
tics of embedded clusters in the solar neighbor- 
hood (Lada & Lada 2003), the luminosities of 
the brightest clusters in different galaxies (Larsen 
2002; Whitmore 2003; WCF07), the similarity of 
the mass spectra of molecular clouds in different 
galaxies (Blitz ct al. 2007), and the fact that 
the early disruption of protoclusters and clusters 
(r < 10^ yr) is driven mainly by internal pro- 
cesses that depend weakly, if at all, on the proper- 
ties of their host galaxies (§4 here). Furthermore, 
from a detailed study of the mass-age distribu- 
tions of clusters in the LMC and SMC, we find 
results nearly identical to those presented here for 
the clusters in the Antennae (CFW09). This is a 
strong test because the Antennae and the Magel- 
lanic Clouds represent very different environments 
for star and cluster formation: two large inter- 
acting galaxies on one hand, two small, relatively 
quiescent galaxies on the other hand. Neverthe- 
less, further tests of this picture in other galaxies 
would be beneficial. If this picture turns out to be 
generally valid, it will mean that the main differ- 
ence between populations of young clusters is sim- 
ply in the normalization of g(Af, t) and hence in 
the overall formation rate. From this perspective, 
the objects traditionally designated open, popu- 
lous, globular, or super clusters would belong to a 
continuum, with mass and age as the most funda- 
mental variables, and would not require different 
formation and/or disruption processes to account 
for their observed properties. 

The picture presented here has the virtues of 
simplicity and possible universality. It captures 
the salient properties of the observed mass-age dis- 
tribution, at least for the populations of clusters 
we have studied so far. Nevertheless, we do not 
expect our model g{M,T) to be strictly univer- 
sal. It may break down outside its domain of va- 
lidity in most galaxies or inside this domain in 
some extreme environments, such as those dom- 
inated by population III stars. There may also 
be minor deviations from the model in some nor- 
mal galaxies, even within the domain of valid- 
ity, caused for example by variations in the for- 
mation rates and/or differences in the disruption 
rates by passing molecular clouds. However, these 
are only likely to affect g{M, r) at intermediate 
ages (lO^yr ^ r < lO^yr), which will be diffi- 
cult to detect observationally. We see an anal- 



ogy between, on one hand, the potentially uni- 
versal stellar IMF, represented by a simple, ap- 
proximate formula (first by Salpeter (1955) and 
then revised in subsequent studies based on more 
modern data) and, on the other hand, the poten- 
tially universal mass- age distribution g{M, r) for 
star clusters, represented here by a simple, approx- 
imate formula. Both the stellar IMF and the clus- 
ter g{M, t) are thought to be the outcome of sev- 
eral complex physical processes, in ways not yet 
fully understood, and neither is expected to hold 
exactly in all situations. Even so, these functions 
provide compact and useful summaries of the ob- 
servations and are thus natural focal points for fu- 
ture studies of the formation and early evolution 
of stars and clusters. 
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A. COMPARISON WITH OTHER WORKS 



The mass and hiininosity functions of star clusters in the Antennae galaxies derived in several other 
studies differ from the simple power laws presented here. However, as we demonstrate in this appendix, the 
claimed peaks, bends, and other deviations from power laws all result from incompleteness in the samples of 
clusters rather than from any physical processes involved in their formation or disruption. Before we discuss 
any specific claims for features in the mass and luminosity functions, however, it is worth considering the 
credibility of such claims in general. Features that appear near the luminosity limit or detection threshold 
should be considered with skepticism, as they are almost always due to incompleteness. Another common 
problem is selection criteria that fail to account for the fading of the clusters with age, or that are linked in 
some way to measurement errors. Finally, we note that it is inherently much more likely that selection effects 
or measurement errors would convert a simple underlying function like a power law into a more complicated 
one with extra features than the other way around. 

These concerns are well founded in practice. Based on observations taken with the WF/PCl on HST 
(before the spherical aberration was corrected), Fritze-von Alvensleben (1999) claimed that the mass function 
of the young clusters in the Antennae galaxies had a lognormal form, with a peak at M ~ IQ^Mq, like that 
of old globular clusters. However, the purported peak was near the detection limit, and much deeper 
observations taken with the WFPC2 (after the correction of spherical aberration), showed that it was simply 
an artifact of incompleteness, and that the mass function was in fact well represented by a power law, 
V'(M) oc M-2, over the full range of masses, IO^Mq <M< IQ^Mq (ZF99 and §3.2 here). 

Mengel et al. (2005) have presented a mass function for the Antennae clusters that exhibits a gradual 

bend at M '--^ 10^ M©. Their result, derived from ground-based K-hand observations, is reproduced here 
as the open triangles in the top panel of Figure 15. This is based on a sample that is limited at a constant 
luminosity with no restriction on age, instead of being limited at a variable luminosity, corresponding to a 
constant mass, and restricted to relatively narrow intervals of age, as in our procedure. Because the clusters 
fade with age, the Mengel et al. sample does not include the same range of masses at all ages; older, low-mass 
clusters will be missed as they fade below the luminosity limit. We illustrate this in the bottom panel of 
Figure 15, which shows a mass function constructed from our own sample but with a selection procedure 
like that of Mengel et al. Here, we have included clusters of all ages brighter than K\i-^ = 19 (open circles), 
where we have estimated the iif-band magnitude of each cluster from the observed V-band magnitude and 
the V — K color predicted by the stellar population models of Bruzual & Chariot (2003) for the age derived 
from our UBVIHa observations. Evidently, this selection procedure introduces a bend in the output mass 
functions similar to the one found by Mengel et al., even though the input mass function is a power law (see 
§3.2 and Fig. 3). We have also performed experiments with luminosities limited in the V and other bands 
and with gradual and sharp cutoffs, and we always recover a similar, artificial bend in the mass function. 
Mengel et al. (2005) themselves appear to be aware of this problem and urge caution in interpreting the 
bend in their mass function as a real feature. 

Anders et al. (2007) have derived a luminosity function of Antennae clusters with a peak near My « —8.5, 
based on the same WFPC2 observations used here but a very different selection procedure. In particular, 
they include only clusters with photometric errors < 0.2 mag in UBVI, thereby excluding many real clusters. 
Since most of the missing clusters are faint, the luminosity function derived in this way will fall below the 
true one at low luminosities, potentially introducing an artificial peak. To test for this, we have repeated 
the steps in the Anders et al. selection procedure, but using our own sample of clusters, for which the input 
luminosity function is known to be a power law (see §3.2 and Fig. 4). In this case, the output luminosity 
function resembles the one presented by Anders et al., with strong deviations from a power law, resulting 
primarily from the elimination of clusters with large uncertainties in the t/-band photometry. Thus, we 
conclude that the purported peak is an artifact of this selection bias. A forthcoming study of the Antennae 
clusters based on deeper, higher-resolution observations taken with the ACS on HST confirms and extends 
the results presented in this paper, finding a luminosity function that is well represented by a pure power 
law with a « — 2 down to at least My « —7, with no evidence for a break or flattening near My « —8.5 
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(Whitmore et al. 2009). 



B. FORMULAE FOR g{M) and ^(t) 

Here, we present formulae for g{M) and .g(T), the averages of g{M,T) over age (ti < t < T2) and mass 
(Ml < M < M2), respectively, for the three disruption models discussed in Section 2. The equations that 
follow were derived from equations (6) and (12)-(15). The function Md(r) in equations (B3) and (B4) is 
the inverse of the function Td{M) in equations (Bl) and (B2). The formulae for g{M) and g(r) arc valid 
for the stated values of the exponents k and 7 in the disruption models. They are valid for all values of the 
exponent of the initial mass function except /3 = — 1, which is far enough from the observed value /? w — 2 
to be irrelevant in the present context. 

Model 1: Sudden Mass- Dependent Disruption. 



for Td{M) < Ti 
g{M) = C{M/M^Y { R{M) for n < Td{M) < t2 

1 for T2 < Td{M) 



R{M) = 



Td{M) - Ti 
T2 — Tl 



1 for Mrf(T) < Ml 
gij) = Ci { S{t) for Ml < Md{T) < M2 
for M2 < Md(r) 



5(r) = 



M^+^ - J\Id(7 



C 



M, 



Mt^ - Mf +1 

6+1 



M2 



J + lJ \M2-M1 
Model 2: Gradual Mass-Dependent Disruption. 

g{M) = C2{M/M,f+>' [T(M,T2) - r(M,Ti) 



Ml 
M* 



6+1 



for k^O 



(Bl) 
(B2) 
(B3) 

(B4) 
(B5) 

(B6) 



T{M,t) = [l + fc(M/M*)-'^(r/r*)] 



(/3+i)/fc 



(B7) 



C2 



C 



g{M) = Cz{M/M^f for fc = 



(B8) 
(B9) 



C 



cxp 



exp 



(3+1 J \T2-T1 

5(t) =Ciexpp+l)(T/T*)] for A; = 



(BIO) 
(Bll) 
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Model 3: Gradual Mass-Independent Disruption. 

g{M) = CA{M/M^f for 7^-1 



C4 



C 



7+1" 



7 + 1 / V "^2 - Ti , 

g{M) = C5{M/M^f for 7 = -1 

ff(T) =Ci(T/r*)T for all 7 



(B12) 

(B13) 
(B14) 

(B15) 
(B16) 
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Fig. 1. — Luminosity- age plane for the Antennae 
clusters. The luminosities are in the V band and 
have been corrected for extinction. The dashed 
diagonal line is the evolutionary track of a model 
cluster with an initial mass of M = 2 x 10^ Mq 
(Bruzual & Chariot 2003), while the solid hori- 
zontal line at L = 3 X 10^ Lq is the approximate 
upper limit for stellar contamination. The vertical 
gap in the data points just above r = 1 x lO'' yr is 
an artifact caused by loops in the color tracks of 
stellar population models during the RSG phase 
(see the text). 
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Fig. 2. — Mass- age plane for the Antennae clus- 
ters. This is an alternative, but equivalent, repre- 
sentation of the data plotted in Fig. 1 . The lines in 
the two diagrams are also equivalent: the dashed 
horizontal one indicates M = 2 x 10^ Mq, while 
the solid diagonal one indicates L = 3 x 10^ Lq, 
the approximate upper limit for stellar contamina- 
tion. The vertical gap in the data points just above 
r = 1 X 10^ yr is an artifact caused by loops in the 
color tracks of stellar population models during 
the RSG phase (see the text). 
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Fig. 3. — Mass function of Antennae clusters in 
the indicated intervals of age. The lines are power 
laws, dN/dM oc , with best-fit exponents (5 = 
-2.14±0.03 (dashed) and /? = -2.03±0.07 (solid) 
for the younger and older clusters, respectively. 
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Fig. 4. — Luminosity function of Antennae clus- 
ters in the V band, with and without correc- 
tions for extinction (as indicated). The lines are 
power laws, dN/dL oc L", with best-fit exponents 
a = -2.09 ± 0.05 (dashed) and a = -2.37 ± 0.06 
(solid) for the corrected and uncorrected luminosi- 
ties, respectively. 
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Fig. 5. Mass-age plane for a simulated popu- 
lation of clusters based on Model 1. This has a 
power-law initial mass function, a constant forma- 
tion rate, and sudden mass-dependent disruption 
with T* = 5 X 10'' yr and k = 0.62 in equation (7). 
The solid line is the approximate upper limit for 
stellar contamination (L = 3 x lO'''iy0), while the 
dashed line indicates the age at which clusters are 
destroyed. The number of clusters in the simu- 
lated population above the solid line is approxi- 
mately the same as in the real Antennae popu- 
lation. The region marked D here is empty, in 
contrast to the same region of Fig. 2. 
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Fig. 6. — Mass distribution averaged over the in- 
dicated intervals of age for the Antennae clusters 
(data points) and for Model 1 (lines). In this 
model, clusters have a power-law initial mass func- 
tion, a constant formation rate, and sudden mass- 
dependent disruption. The dashed and solid lines 
were computed for the younger and older clusters, 
respectively, from equations (7), (Bl), and (B2) 
with T* = 5 X 10'^ yr and k = 0.62. 
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Fig. 7. Age distribution averaged over the; in- 
dicated intervals of mass for the Antennae clus- 
ters (data points) and for Model 1 (lines). In this 
model, clusters have a power-law initial mass func- 
tion, a constant formation rate, and sudden mass- 
dependent disruption. The dashed and solid lines 
were computed for less massive and more mas- 
sive clusters, respectively, from equations (7) and 
(B3)-(B5) with r* = 5 x 10^ yr and k = 0.62. 
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Fig. 8. — Mass distribution averaged over the 
indicated intervals of age for the Antennae clus- 
ters (data points) and for Model 2 with k = 
(lines). In this model, clusters have a power- 
law initial mass function, a constant formation 
rate, and gradual mass-dependent disruption. The 
solid lines were computed for the older clusters 
only from equations (B9) and (BIO) with the in- 
dicated values of . The dashed line, with = 
1.4 X 10'' yr, provides a good fit to the observed 
distribution for the older clusters. 
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Fig. 9. — Mass distribution averaged over the 
indicated intervals of age for the Antennae clus- 
ters (data points) and for Model 2 with k = 0.6 
(lines). In this model, clusters have a power-law 
initial mass function, a constant formation rate, 
and gradual mass-dependent disruption. The solid 
lines were computed for the older clusters only 
from equations (B6)-(B8) with the indicated val- 
ues of r*. 
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Fig. 10. — Mass distribution averaged over the 
indicated intervals of age for the Antennae clus- 
ters (data points) and for Model 2 with k = 1.0 
(lines). In tliis model, clusters have a power-law 
initial mass function, a constant formation rate, 
and gradual mass-dependent disruption. The solid 
lines were computed for the older clusters only 
from equations (B6)-(B8) with the indicated val- 
ues of r*. 
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Fig. 11. — Age distribution averaged over the in- 
dicated interval of mass for the Antennae clus- 
ters (data points) and for Model 2 with k = 
(lines). In this model, clusters have a power- 
law initial mass function, a constant formation 
rate, and gradual mass-dcpondcnt disruption. The 
solid lines were computed from equations (B5) and 
(Bll) with the indicated values of r*. The dashed 
line hero corresponds to the dashed line in Fig. 8 
(both with r* = 1.4 X lO'^ yr). 



28 




log (M/M<^ 

Fig. 12. — Mass distribution averaged over the 
indicated intervals of age for the Antennae clus- 
ters (data points) and for Model 3 with 7 = —1.0 
(lines). In this model, clusters have a power- 
law initial mass function, a constant formation 
rate, and gradual mass- independent disruption. 
The dashed and solid lines were computed for 
the younger and older clusters, respectively, from 
equations (B14) and (B15). 
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Fig. 13. — Age distribution averaged over the in- 
dicated intervals of mass for the Antennae clus- 
ters (data points) and for Model 3 with 7 = —1.0 
(lines). In this model, clusters have a power-law 
initial mass function, a constant formation rate, 
and gradual mass- independent disruption. The 
dashed and solid lines were computed for the less 
massive and more massive clusters, respectively, 
from equations (B5) and (B16). 
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Fig. 14. — Mass functions of young clusters in the 
Antennae and LMC (upper panel) and old globu- 
lar clusters in the Milky Way and Sombrero (lower 
panel), as described in the text. The smooth 
curves in both panels are from the simple evapora- 
tion model, cqn. (18) with ^ev = 2 x lO~^M0yr~-^, 
at ages t = 0, 1.5, 3, 6, and 12 Gyr. Note the ex- 
cellent fit of the model to the data at both r = 
(upper panel) and r = 12 Gyr (lower panel). 
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Fig. 15. — Upper Panel: Mass function of Anten- 
nae clusters from this work (filled circles) and from 
Mengel et al. (2005) (open triangles). The former 
is based on an age-restricted subsample (r = 1- 
10 Myr); the latter is based on a subsample limited 
in iiTg-band luminosity but with no restriction on 
age. The vertical dashed line is the stated 50% 
completeness limit for the Mengel et al. sample 
{Ks « 18.4). Lower Panel: Mass function of An- 
tennae clusters from this work (filled circles) and 
for a subsample with a luminosity limit similar to 
that of Mengel et al., based on Ks magnitudes 
predicted from stellar population models and the 
observed V magnitudes (open circles). 
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